Personal speech coding

نویسندگان

  • Wenhui Jin
  • Wai-Yip Chan
چکیده

In existing speech coding systems, all quantizer codebooks are designed to suit the statistical and perceptual characteristics of speech signals of a population of speakers. However, an individual’s speech signal does not exhibit, even over a long time, the entire range of characteristics of the population. With the advent of the personal communication systems, personal information might become available and be used to improve the rate-distortion performance of speech coders. In this paper we assess the potential gain of personal speech coding by designing codebooks for individual speakers. Spectral quantization, excitation and pitch lag codebooks of existing CELP coders are redesigned. The gains appear to be modest, suggesting that we need to use a different coding framework, which can model personal characteristics explicitly. Amongst the components, the spectral quantiser seems to be most amenable to personalisation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition Using Lpc and Hmm Applied for Controlling Movement of Mobile Robot

This paper describes about a project of speech recognition, which is applied on a mobile robot for controlling movement of the robot. The methods that used in this project are Linear Predictive Coding (LPC) and Hidden Markov Model (HMM). LPC method is used to extract data of speech, on the other hand HMM is used to recognize the unknown speech pattern. This system is implemented on personal com...

متن کامل

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

NTT DOCOMO Technical Journal Vol. 13 No. 3 ©2011 NTT DOCOMO, INC. Copies of articles may be reproduced only for personal, noncommercial use, provided that the name NTT DOCOMO Technical Journal, the name(s) of the author(s), the title and date of the article appear in the copies. *1 ISO: An organization for standardization in the information technology. Sets international standards for all indus...

متن کامل

A Bengali Speech Synthesizer on Android OS

Different Bengali TTS systems are already available on a resourceful platform such as a personal computer. However, porting these systems to a resource limited device such as a mobile phone is not an easy task. Practical aspects including application size and processing time have to be concerned. This paper describes the implementation of a Bengali speech synthesizer on a mobile device. For spe...

متن کامل

Objective Speech Quality Evaluation. A primarily Experiments on a Various Age and Gender Speakers Corpus

The present work was carried out to investigate the relation between a speaker’s age and the quality of the encoded speech for several, well known speech coding techniques and a commercially available vocoder CMX618. For this purpose, a corpus of speech records obtained from speakers of various age and gender groups was created. The PESQ (Perceptual Evaluation of Speech quality) method was used...

متن کامل

FBG Model Based Low Rate Coding of Speech

represented by a set of fbg filters each corresponding to a resonance peak where f is the formant frequency, b the 3 dB bandwidth and g the gain of a resonant peak. Speech coding algorithms are basic components in existing and future personal communication systems. Reducing the bit rate in transmitting speech is by way a matter of using slowly varying and numerical robust parameters to represen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998